SASH: A Self-Adaptive Histogram Set for Dynamically Changing Workloads
نویسندگان
چکیده
Most RDBMSs maintain a set of histograms for estimating the selectivities of given queries. These selectivities are typically used for costbased query optimization. While the problem of building an accurate histogram for a given attribute or attribute set has been well-studied, little attention has been given to the problem of building and tuning a set of histograms collectively for multidimensional queries in a selfmanaged manner based only on query feedback. In this paper, we present SASH, a Self-Adaptive Set of Histograms that addresses the problem of building and maintaining a set of histograms. SASH uses a novel two-phase method to automatically build and maintain itself using query feedback information only. In the online tuning phase, the current set of histograms is tuned in response to the estimation error of each query in an online manner. In the restructuring phase, a new and more accurate set of histograms replaces the current set of histograms. The new set of histograms (attribute sets and memory distribution) is found using information from a batch of query feedback. We present experimental results that show the effectiveness and accuracy of our approach.
منابع مشابه
A Self-Adaptive Model to Improve Average Response Time of Multiple-Event Filtering for Pub/Sub System
Publish/subscribe system captures the dynamic aspect of the specified information by notifying users of interesting events as soon as possible. Fast response time is important for event filtering which requires multiple step processing and is also one of important factors to provide good service for subscribers. Generally the event arrival rate is time varying and unpredictable. It is very poss...
متن کاملDynamic Weighted Fusion of Adaptive Classifier Ensembles Based on Changing Data Streams
Adapting classification systems according to new input data streams raises several challenges in changing environments. Although several adaptive ensemble-based strategies have been proposed to preserve previously-acquired knowledge and reduce knowledge corruption, the fusion of multiple classifiers trained to represent different concepts can increase the uncertainty in prediction level, since ...
متن کاملThe Time Adaptive Self Organizing Map for Distribution Estimation
The feature map represented by the set of weight vectors of the basic SOM (Self-Organizing Map) provides a good approximation to the input space from which the sample vectors come. But the timedecreasing learning rate and neighborhood function of the basic SOM algorithm reduce its capability to adapt weights for a varied environment. In dealing with non-stationary input distributions and changi...
متن کاملFlexSplit: A Workload-Aware, Adaptive Load Balancing Strategy for Media Cluster
A number of technology and workload trends motivate us to consider a new request distribution and load balancing strategy for streaming media cluster. First, in emerging media workloads, a significant portion of the content is short and encoded at low bit rates. Additionally, media workloads display a strong temporal and spatial locality. This makes modern servers with gigabytes of main memory ...
متن کاملMachine Learning for On-Line Hardware Reconfiguration
As computer systems continue to increase in complexity, the need for AI-based solutions is becoming more urgent. For example, high-end servers that can be partitioned into logical subsystems and repartitioned on the fly are now becoming available. This development raises the possibility of reconfiguring distributed systems online to optimize for dynamically changing workloads. However, it also ...
متن کامل